Binary classification with ambiguous training data
نویسندگان
چکیده
منابع مشابه
Coverage-performance estimation for classification with ambiguous data
Classifier tradeoffs between accuracy and specificity are often analyzed with receiver operating curves (ROC). Here we study a related analysis of the data in terms of coverage–performance curves (CPC) which more clearly indicate the presence of ambiguous data in classification problems with overlapping class distributions. We show that feedforward mapping networks are well suited to derive suc...
متن کاملA Classification Scheme for Applications with Ambiguous Data
We propose a scheme for pattern classifications in applications which include ambiguous data, that is, where pattern occupy overlapping areas in the feature space. Such situations frequently occur with noisy data and/or where some features are unknown. We demonstrate that it is advantageous to first detect those ambiguous areas with the help of training data and then to re-classify those data i...
متن کاملA Performance Measure for Classification with Ambiguous Data
Real world data can be difficult to classify due to overlapping classes of ambiguous data. One solution to this problem is to leave out data before classifying, while another solution is to first classify the data and then prune those results which are ambiguous. However, a problem exists in determining which data are ambiguous. In this paper we propose a performance criteria which gives a prec...
متن کاملUsing SVM for Classification in Datasets with Ambiguous Data
One of the challenges in machine learning is the classification of datasets with ambiguous instances. In this paper we study specifically datasets with examples that have overlapping feature values for different classes. In these circumstances there is a bound on the classification performance. While there seems to be a race for accuracy, very little has been done to understand and solve the is...
متن کاملSimple Classification using Binary Data
Binary, or one-bit, representations of data arise naturally in many applications, and are appealing in both hardware implementations and algorithm design. In this work, we study the problem of data classification from binary data and propose a framework with low computation and resource costs. We illustrate the utility of the proposed approach through stylized and realistic numerical experiment...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Machine Learning
سال: 2020
ISSN: 0885-6125,1573-0565
DOI: 10.1007/s10994-020-05915-2